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Abstract. In this article, we define the transport dimension of probabiUty 
measures on M"* using ramified optimal transportation theory. We show that 
the transport dimension of a probability measure is bounded above by the 
Minkowski dimension and below by the Hausdorff dimension of the measure. 
Moreover, we introduce a metric, called "the dimensional distance", on the 
space of probability measures on R'". This metric gives a geometric meaning 
to the transport dimension: with respect to this metric, we show that the 
transport dimension of a probability measure equals to the distance from it to 
any finite atomic probability measure. 



1. Introduction 

The theory of ramified optimal transportation aims at finding an optimal trans- 
port path between two given probability measures. One of the measures is rep- 
resenting the source while the other is representing the target. A transport path 
is typically in the form of a tree-shaped branching structure. A natural question 
is: given a probability measure, is it possible to transport it to a Dirac mass via 
a finite cost transport path? The answer to this question crucially depends on the 
dimensional information of the given measure. 

In [8], Devillanova and Solimini studied the irrigability dimension of measures 
using optimal transportation theory. For any given probability measure /i on M™, 
the irrigability dimension of /i is defined by 

dim/ (u) := inf | : if u is a — irrigable 

o<Q<i [ 1 - a 

The main theorem in [8J shows that 

(1.1) max {dim/f (/i) , 1} < dim/ (/i) < max{l,dimjv/ (m)} 

where dim// (/i) (or dim^/ {^J.)) denotes the infimum of the Hausdorff dimension 
(or the Minkowski dimension, respectively) of sets that /i is concentrated on. By 
definition, the irrigability dimension dim/ (/i) of a measure /i must be larger or 
equal to 1 as the parameter a is in the range of [0, 1). 



In this article, we aim at removing the maximum constraint from ( 1.1 1 by using a 
different approach of ramified optimal transportation (i.e. using optimal transport 
paths), and also allowing the parameter a to be negative. This generalization allows 
us to consider measures which have fractal dimensions (e.g. the Cantor measure) 



2000 Mathematics Subject Classification. Primary 49Q20, 51F99; Secondary 28E05, 90B06. 
Key words and phrases, optimal transport path, ramified optimal transportation, fractal di- 
mension of measures, transport dimension, irrigation dimension. 
This work is supported by an NSF grant DMS-0710714. 



2 



QINGLAN XIA, ANNA VERSHYNINA 



less than 1. Wc introduce an analogous concept called "the transport dimension of 
, and show in theorem |3.5.2| that 

dim^ (fi) < dimy (^) < dimM (m) 

with a slight modification of the definition of dimAf (/i). The major difference 
between dim/ (/i) and dim^ (/i) is that dimy (/i) is allowed to be less than 1. For 
instance, for the Cantor measure, we show that dim^ (/i) is which is exactly 
the dimension of the Cantor set. 

Moreover, we find a geometric meaning to the transport dimension by introduc- 
ing a new metric, called "the dimensional distance", on the space of equivalent 
classes of probability measures. With respect to this metric, the distance of any 
probability measure to a Dirac mass (or any atomic probability measure) equals to 
the transport dimension of the measure. In other words, the transport dimension 
of a measure quantitatively describes how far the measure is from being an atomic 
measure. 

We refer to the book [31 as well as references there for a modern account of opti- 
mal transportation with branching structures. In particular, a partial list of most 
relevant works is listed in the reference: [T],[g,[l],[B],[7],[n],[TD],[IT],[Tl],[Tl],[Tn],[TS], 
[IH],[Ipl and of course [S]. 

Organization of the paper. We first recall some basic concepts about optimal 
transport paths in section 2 with some necessary modifications. We show that for 
any a G (— cx), 1), there exists a metric da on the space A{M.™) of atomic proba- 
bility measures. Then, we consider the metric completion Va (K'") of ^(M™) with 
respect to each da ■ We show that when a probability measure is concentrated on a 
set whose box dimension is less than , then the measure belongs to Va ■ After 
that, we begin to consider the transport dimension of measures in section 3 with 
some comparison with other dimension of measures. In particular, we show that 
the transport dimension of a measure is bounded above by its Minkowski dimen- 
sion and bounded below by its Hausdorff dimension. We also show the transport 
dimension of the Cantor measure is In section 4, we consider the dimensional 
distance between probability measures. The main result there says that the trans- 
port dimension of a probability measure is given by the distance of the measure to 
any atomic probability measure with respect to the dimensional distance. 



2. The da metric on probability measures for a G (-oo, 1) 

2.1. Transport paths between atomic measures. We first recall some basic 
concepts about transport paths between measures of equal total mass as studied in 
|llj . with some necessary modifications. 

Recall that a (finite) atomic measure on E™ is in the form of 

k 

(2.1) a ^ mjSxi 

with distinct points Xi G M™, and positive real numbers to,;, where 5.^. denotes the 
Dirac mass located at the point x. The measure a is a probability measure if its 
mass ||a|| := X^iLi "^i ^ 1- Let ^(M™) be the space of all atomic probability 
measures on K"*. 
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Definition 2.1.1. Given two atomic measures 

k I 
(2.2) a = ^ ] iTT-iSxi andh = ^^^njSy^ 

i=l 3=1 

in W^of equal mass, a transport path from a to b is a weighted directed graph 
G consisting of a vertex set V{G), a directed edge set E{G) and a weight function 
w : E{G) (0, +oo) such that {xi,X2, x^} U {yi,y2, Hi} C V{G) and for any 
vertex v G V{G), 
(2.3) 

{rrii, if V = Xi for some i = 1, k 
—Uj, if V = yj for some j — 1, ...,£ 
0, otherwise 

where and e+ denotes the starting and ending endpoints of each edge e G E{G). 



Note that the balanced equation (2.3 1 simply means the conservation of mass at 
each vertex. In terms of polyhedral chains, we simply have dG — b ~ a. 
For any two atomic measures a and b on M™ of equal total mass, let 

Path{a,h) 

be the space of all transport paths from a to b. It is easy to see to that Path (a, b) 
is always nonempty. 

Definition 2.1.2. For any a < 1 and any transport path G G Path(a.,h) , we define 
M„(G) w{e)''length{e). 

We now consider the following ramified optimal transport problem: 

Problem 1. Given two atomic measures a and h of equal mass on M™ and — oo < 
a < 1, find a minimizer of 

M„(G) 

among all transport paths G G Path {a, h). 

An Mq, minimizer in Path{a,h) is called an a— optimal transport path from a 
to b. 

Note that in pj, Definition 2.2], we only allow that < a < 1. But now, for 
the purpose of studying fractal dimension of measures in this article, we also allow 
negative a. Negative a also corresponds to many transport problems in reality. For 
instance, the "cost" (i.e. risk here) for a baby to go back home from his preschool 
by himself is much higher than the one for an adult. This is why babies need to 
be picked up by adults. In this example, the cost of transporting a higher density 
mass (i.e. an adult) is less than the cost of transporting a lower density mass (i.e. 
a baby). Negative a is used for such phenomenon. Another application of ramified 
optimal transportation with negative a may be found in |17j . 

Definition 2.1.3. For any a < 1, we define 

da (a, b) = inf {M„ (G) : G G Path (a, b)} 
for any a,b Gyi(M™). 
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Remark 2.1.4. If a and b are two atomic measures of equal total mass A, and let 
a = -^a and b = -^b be the normalization of a and b. Then, we set 



(a,b 



A"d„ (a,b) 



2.2. Behavior of transport paths for a negative a. The behavior of an a— optimal 
transport path is quite different for a negative a and a nonnegative a. For in- 
stance, in proposition 2.1], we shown that for nonnegative a, an a-optimal 



transport path contains no cycles. On the hand, from the following example 2.2.1 
an a— optimal transport path may prefer to have a cycle when a is negative. 

Example 2.2.1. Let a = Sq, h =0.8(5o + 0.2(5i e ^(M), then we may construct two 
transport paths from a to b. One contains a cycle while the other does not. 

m 0.2 . ^ 



0.8 

The first transport path G, The 2nd transport path G, 



Figure 1: For a < 0, an optimal transport path may contain a cycle 



The first transport path Gi consists of only one directed edge from Q to \ with a 
weight 0.2. The second transport path G2 consists of two directed edges: one from 
to 1 with a weight 1 and the other from 1 to with a weight 0.8. Then, when 
a <—\, we have 

M^(G'2) ^ l" + 0.8° ^ 0.8° + 0.8° ^ 2i+2a < 1 
M„(Gi) 0.2° - 0.2° 
Thus, Mc {G2) < Mq, (Gi). i.e. a path containing a cycle may have less Mq, cost. 

A transport path may also contain another type of cycles, which are unpleasant 
for our study. 




Figure 2: An unpleasant cycle appeared in a transport path 
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Example 2.2.2. From definition, it is possible for a transport path to contain 
another type of directed cycles which are similar to the one appearing in figure 
Using this directed cycle, the original weight " 0.5" of the edge in the example 
may either be repeatedly counted or be combined with some mass from unknown 
sources (which may have nothing to do with a orh), and lead to a new very high 
weight " n + 0.5" in the end. When a is negative and n is large, (n + 0.5)" may 
become a very small positive number, much less than 0.5". Thus, for negative a, 
one may significantly decrease the cost of a transport path by repeatedly adding 
a directed cycle of this type with a very large weighting constant n. Since the value 
n" approaches zero as n approaches to infinity when a < 0, allowing such kind 
of directed cycles in the family Path (a, b) may eventually lead to an unpleasant 
result: da (a, b) = for a 7^ b. 

The cycle in example |2.2.1| is desirable while the cycle in example |2.2.2| is not what 
we want. To overcome this conflict of interests, we adopt the following convention 
on the definition of transport paths when a is negative. 

Convention 2.2.3. For any vertex v d V (G) of a transport path G, if there exists 
a list of vertices {vi,V2, ■ ■ ■ ,Vn\ such that vi — Vn = v and [vi,Vi+i] is a directed 
edge in E (G) with a positive edge length for each i = 1, 2, • • • ,n — 1, then we will 
view Vn as a different copy of the point v. In other words, f„ and v are two 



different vertices in V (G) and thus the balance equation (2.3) must be separately 
hold at each of them. 

Using this convention, one cannot "combine" the weights of edges from the vertex 
Vn with weights of edges from the different vertex vi. So the weight of an edge 
cannot be repeatedly counted leading to a very large number, and must be bounded 
above by the total mass of a. As a result, the convention imphes two consequences: 
a transport path will no longer contain a directed cycle in the type of example 1 2 . 2 . 2 



Moreover, we have a universal upper bound (i.e. the total mass of the source) on 
the weight of each edge: 

(2.4) «^(e)<||a|| 

for each edge e of a transport path G from a to b. In particular, if a is a probability 
measure, then w [e) < 1. 

For nonnegative a, since an a— optimal transport path contains no cycles (as 
shown in [TTJ proposition 2.1]), there is no need to adopt this convention then. 

Many properties about transport paths have been studied in [H] , [H] , IS] , 
and (3] when < a < 1. For instance, we showed in [11] theorem 5.1] that 
{A{R"^),da) is a geodesic space, and it is indeed a metric induced by a quasimetric 
(see [16j). As the main purpose of this article is studying dimensions of measures, 
we will leave the study of properties of optimal transport paths in the situation of 
a < to a later article. Currently, we only need to show that da is still a metric 
on AiM."") when a < 0. 

2.3. The da metric when a < 0. We denote S{p,r) (and B{p,r), respectively) 
to be the sphere (and the closed ball, respectively) centered at p € K"* of radius 
r > 0. Note that for any transport path G, the restriction of G on any closed ball 
B{p,r(j) gives another transport path G\g(^p j.^y 

Lemma 2.3.1. Suppose a and b are two atomic measures on M™ of equal total 
mass, and G is a transport path from a to b. // the intersection of G S (p, r) as 
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sets is nonempty for almost all r G [0, ro] for some rg > 0, then for any a < 0, we 
have 

M„ (G|b(p,,„)) > A"ro, 
where A is any upper bound of the weights of edges in G|^(-p ,,o)- 
Proof. For any r > 0, let 

{e eE{G) -.enSip^r)^ 0} 

be the family of all edges of G that intersects with the sphere S {p, r) . By assump- 
tion, Er is nonempty for almost all r G [0,ro]. 

Let L be any fixed ray with endpoint p, and let P : M™ ^ i be the projection 
that maps any point x G to the point P (x) £ L with \P (x) — p\ — \x ~ p\. Note 
that for any edge e G E (G), under the projection P, the length of the segment P (e) 
is no more than the length of e. Therefore, 

M„(G|s(p,.„)) = [«^(e)]"length(e) 

e&E{G\B{p,ro)) 

> Yl [w{eriength{P{e)) 

eeE(G\B(p,ro)) 

Y [w{e)T XP{e){r)dr 

eeE{G\B{p,ro)) 

Y [w{e)fdr. 

where in the second equality XA denotes the characteristic function on a set A. So, 
we have shown that 

pro 

(2.5) 



pro 

M„(G|s(p,,„)) > / Y i^i^Td^- 
Now, since w (e) < A for any e G ^ and a < 0, (2.5 1 yields 

pro pro 

M„(G|g(p, .) > / ma.x[w{e)rdr> A"dr = A"ro. 
Jo Jo 

Corollary 2.3.2. Suppose a < and p G M™. Then, for any atomic measure 

k 
i=l 

on M'" with mass ||a|| :— X]i=i f^i > and any G G Path ^a, ||a|| 5^ we have 

M„(G) > llairip-x,! 

/or each i — \,2, ■ ■ ■ ,k. 



□ 



Proof. By (2.4 1, w (e) < ||a|| for any edge e G i? (G). Then, the result follows from 
lemma 2.3.1 by setting tq = maxi<i<fc {\p ~ Xij} and A = ||a||. □ 



The inequality (2.51 also gives a lower bound estimate for positive a. 
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Corollary 2.3.3. Suppose < a < 1. For any a ey4.(M™) in the form of {2.1), 
p G M™ and rg > 0, we have 



(2.6) 



J2 ™^ 

d{p,Xi)>ro 



< 



Proof. Let A = J2d{p xi)>ro ^ transport path from a to 6p. Then, 

for any < r < rQ, we have 

eeE,, 



By the inequahty (2.5 1, since the function / (x) 
< a < 1, we have 



a;" is concave on [0, 1] when 



Ma(G|s(p,,„)) > 



eeE 



> 



iw{e)r dr 



dr > 



X°'dr = A"ro. 



Therefore, we have (2.6 1. 



□ 



Proposition 2.3.4. d^ is a metric on 



Proof. Obviously, we only need to consider the case a < 0. In this case, it is clear 
that da is nonnegative and symmetric. Now, for any a, b,c and any path 

Gi e Path (a, b) and G2 € Path (b, c), let G3 be the disjoint union of the directed 
weighted graphs Gi and G2. That is, 

V (G3) = {V (Gi) \ V{h)) H {V (G2) \ V{h)) H Vih) 

and E (G3) — E (Gi) ]J E (G2), where the symbol ]J denotes the disjoint union of 
sets, and V{h) denotes the vertex set (i.e. the support) of the measure b. Then, 
by using the convention |2.2.3| G3 is a transport path from a to c with 

M„ (G3) = M„ (Gi)+M„(G2). 

Thus, by taking infimum, we have the triangle inequality 

da (a, c) < da (a, b) + da (b, c) . 

Now, we only need to check that if a 7^ b, then dQ,(a, b) > 0. We may as- 
sume that a and b are in the forms of (2.2). If the supports of a and b are 
different, i.e. {2/i,y2,''' ,2/;} 7^ {xi,X2,--- ^xu} as sets, we may assume that 
yi ^ {xi,X2, • • • , Xk}. In this case, we set p = yi and 

To := min {|p — x^j : i = 1, 2, • • • , fc} > 0. 

If {xi,X2, ■ ■ ■ ,Xk} = {j/i, 2/2, • • ■ ,yi} as sets, then we may assume 

k k 

a = rUiSx- and b = riiS^^ 

i=l 1=1 

and mi 7^ jt-i. In this case, we set p — xi and let 

ro := min{|p- : ^ n.,, i = 2, • • • , fc} > 0. 
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In any of these two cases, for any transport path G from a to b, the intersection of 



G with the sphere S {p,r) is nonempty for any < r < rg. By lemma 2.3.1 

M„ (G) > ro 
since w (e) < 1 for any edge e £ E (G) . Therefore, 

da (a, b) > ro > 0. 

□ 

2.3.1. The completion of AiW^) with respect to da. 

Definition 2.3.5. For any a e (— oo, 1], let 7'ct(IR'") be the completion of the metric 
space A(W^) with respect to the metric da- 

We will simply write VaiM"") as Va- 

Note that when a = 1, the metric di is the usual Monge's distance on ^(M™) 
and Vi is the space V of all probability measures on E™. Now, the following lemma 
implies that each element in Va can be viewed as a probability measure on M™. 

Lemma 2.3.6. If 13 < a, then 7'/3(M") C 7'a(M™), and for all pL,v m VpiR"') we 
have dfj{^,v) > da{^J,,v)- 

Proof. Note that for any path G between any two probability measures a and b in 
we have 

Ma{G) = w(e)"length(e) < ^ w(e)'^length(e) = Mp{G). 

eeE(G} eeE{G) 

This is because w{e) < 1 for any e G E{G). Hence 

da{a,h) = inf{M„(G) : G G Path(a,b)} 

< inf{M^j(G) : G € Path(a, b)} = d^(a, b). 

Therefore, any Cauchy sequence {a„} in A{W^) with respect to the metric dp is 
also a Cauchy sequence with respect to da. Thus, Vpi^™') C 7^q(IR™) with 

da{tJ.,t^) < dpifJ.,!^) 

for any in 'P/3(M'"). □ 

As discussed in [8j the Minkowski-Bouligand dimension, also known as box- 
counting dimension of a set A is given by 

dim;,oj.(^) = mf{f3 > : yl can be covered by GpS'^ balls of 
(2.7) radius 5 for all 5 less than 1 and some constant Cp}. 

A positive Borel measure on M™ is said to be concentrated on a Borel set A if 
^(M™ \ ^) = 0. 

Theorem 2.3.7. Suppose ^ is a probability measure concentrated on a subset A of 
M™ with dimi,ox{A) < y~ for some < a < 1, then ^ E Va- 

Proof. Since dimhox{A) < jz^, we may fix a constant /3 such that diaii,ox{A) < (i < 
jzr^. By (2.7l, for every rt G N, the set A can be covered by balls |i3 jtt^ | . 



of radius ^ centering at x]" , where the number Nn of balls of radius ^ is bounded 
above by Cp (^) ^ for some constant Cp. Using Vitah's covering theorem (see for 
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instance, [3 section 1.5.1]) we may assume that the balls |i? (^x'f'\ 57^^^^ | 
disjoint. For each ball B (^x["'\ , let 



are 



be the set of centers of all balls in the preceding cover that intersects with B {x^l^^ i^^- 

Then one can easily check that the cardinality fc^-"-* of is less than ("y)™- 

Using these family of covers, we construct a Cauchy sequence {a„} for [i as 
follows. For each n, let 

1=1 

where the positive number m>f^ = fi{B ^a:;-"\ \ U/i=\ ^ (2;!^"^ ^^)- Then, for 

each n > 1, we may construct a partition of a„ — X^jl^T^ ^i"; with respect to 
a„_i by setting 

A„ 



(«-l) _ Vm^"^^ , ^ 

i=l 

- M ( (s (.;■■', ^) \ u « ^)) n B (4-', ^) 



i=l 

where the number 

i-l 



h=l 

if x^r'^ e and we set TO,-"-* to be zero otherwise. Also, let G" , ^ be the path 
sending a mass to^"^ from the center a:^"-* to every point x^" G ^t^^ ■ Then, 

A„ 

d„(a„_i,a„) < ^M„(G::-1) 



ix<"'-4—>i 



A„ 



Now, by Holder inequality, we have 
da (aji— 1 , a,j) 



\i=l / \i=l 

A„ 



3 /17\™/_ /1A"'^\^ " „ /17 



10 QINGLAN XIA, ANNA VERSHYNINA 

where b — £ (0, 1), because /3 < As a result, {a„} is a Cauchy 

sequence representing /i with respect to the metric da and therefore /i G T'q. □ 

Corollary 2.3.8. Suppose ^ is a probability measure on M™ with a compact sup- 
port. Then ji G Va for any a G {\ — ^,\^ . 

3. Dimension of measures 

In this section, we will study properties of measures that belong to a special 
subset of Va, and then define the transport dimension of measures. 

3.1. rf(j admissible Cauchy sequence. 

Definition 3.1.1. Let {afc}^j^&e a sequence of atomic measures of equal total mass 
in the form of 

afc = ^mfh^^k) 
1=1 

for each k. We say that this sequence is a da — admissible Cauchy sequence if 



a 

n 




Figure 3: An example of a transport path between Uk and a„ . 

for any e > 0, there exists an N such that for all n > k > N there exists a partition 
of 

Nk 
i=l 

with respect to as sums of disjoint atomic measures and a path (see figure^ 
for each i — 1,2, ■ ■ ■ , Nk such that 

Nk 
i=l 
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Also, we denote Gfj = Si=i iJ which is a path from a.k to a„ with M.a{Gj^) < e. 
Each da — admissible Cauchy sequence corresponds to an element in 
Let 

be the set of all probability measures fj, which corresponds to a d^, admissible Cauchy 
sequence of probability measures. For simplicity, we may write I?q(M"') as Va. 

It is easy to see that if for each fc, there is a partition of ak+i = J^i^J^i ^i+i i 
with respect to as sums of disjoint atomic measures and a path G 



k+l,i 



Path (^mf'^S^(k) , a^*^^ for each i = 1,2, Nk s 



. such that 



k=l \i=l 



< +00, 



then {a„} is a da-admissible Cauchy sequence. 

Also, note that if /i, S I?q,(IR™), one automatically has da (a*, i^) < 00. 

Before discussing properties of Pq, , we give some examples of elements of as 
follows. The first example is a strengthening of corollary 1 2.3.8 [ 

Example 3.1.2. Let fi be any probability measure supported on a compact subset 
of M™ . Then /i G whenever a > 1 — 

Let B be a cube in M™ of side length I that contains the support of /i. For each 

r In) 1 ^" 

n, by using dyadic decomposition of B, we get a family of smaller cubes | 
of generation n centered at and of side length where Nn — 2™". Then, set 



For each n, by setting 



*n+l,i 



i=l 



E 



for each i = 1, 2, • • • , 7V„, we get a partition of a„+i = X^i^Ti^^ ^i+i i ^ith respect 
to a„. Also, by transporting the corresponding mass fi ^sj"^^'^ fr 



om a;.-"' to each 



Xj ' ' , we build an obvious path G"_|_j j e Path i ^ I b["''^ 1 5^(„) , aj^'^-^ 



XXX 


X 


X X X 


X 


X X X 


X 


yK yK 


X 
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Now, it is easy (see [TTJ proposition 3.1] for instance) to check that 

oo / Nk \ 
k=l \i=l I 

whenever a > 1 — ^ , and thus [i G Va ■ 
Example 3.1.3. Cantor measure 
Let 

i=l 

where centers of intervals of the z*'* Cantor interval of length By 

transporting the mass ^ to each Xn,i from the center of the interval in the previous 
step, we construct a path G""^ from a„_i to a„. 



Now, ^ a ^ 

2" 



1=1 ^ ^ 



1 \ /I 

3 



Therefore, {a„} forms a da-admissible Cauchy sequence whenever — g — < 1, that is 
whenever > j^rfi which is exactly the fractal dimension of the Cantor set. The 
measure represented by this Cauchy sequence {a„} is called the Cantor measure. It 
is the usual Ti^ [C where C is the Cantor set and s = is its Hausdorff dimension. 
This shows that the Cantor measure is in Va whenever > • Note that a is 

" 1 — o In J 

allowed to be negative here. 

Example 3.1.4. Fat Cantor measure 

A fat Cantor set is constructed in the same way as constructing a Cantor set 
except that an interval of length A is removed from the middle of [0, 1] for A e (0, 1). 
Again, we set 

1 2" 

1=1 

where centers of intervals of the i*'* Fat Cantor interval of length (^^2^) ■ 

By transporting the mass ^ to each Xn.i from the center of the interval in the 
previous step, we construct a path G""^ from a„_i to a„. Then, 

where p = The sequence {a„} forms a rfc-admissible Cauchy sequence when- 
ever 2^~°'p < 1, that is whenever 

1 In 2 In 2 



l~a Inp ln2-ln(l-A) 

which is the fractal dimension of the Fat Cantor set. The measure represented by 
this Cauchy sequence {a„} is called the Fat Cantor measure. This shows that the 
Fat Cantor measure is in I?„ whenever > — 
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Example 3.1.5. Self similar measures 

Let A be any bounded self similar set in the sense that A is the finite union of 
sets for i = 1, • • • ,k with each Ai being a cr— rescale of A. Pick any point x* E A 
as the center of A. Then, each copy of A of generation n has a corresponding 
center x,-"''. Now, we set 



j(n-l) 



By transporting the mass ^ to each xf' from the center of the set A 

in the previous step, we construct a path GJJ"^ from a„_i to a„. Note that 
< cr"^^i, where L is the diameter of the set A. Therefore, 



(n) (n-1) 



1 



L 



Thus, 



^M„(G"-^) is finite 



CT < 1 



1 Infc 
I — a In (7 



Therefore, the sequence {a„} forms an da-admissible Cauchy sequence whenever 



1 Infc 
I — a In cr ' 



which is the fractal dimension of the self similar set A. The measure represented 
by this Cauchy sequence {a„} is called a self similar measure. This shows that a 
self similar measure is in I?„ whenever > 

^ 1 — a 



Infc 
Inc 



3.2. Hausdorff dimension of measures. Let denotes s dimensional Haus- 
dorff measure on M™ for each s > 0. 

Theorem 3.2.1. If n (z Da for some a <1, then jj, is concentrated on a subset A 
o/M" with (A) ^0. 

Proof. Since fi G Da, it is represented by a dc-admissible Cauchy sequence {a^} in 
the form of 

afc =y'm'l''^d^(k). 

i=l 

By the definition 3.1.1 and taking a subsequence of {a^} if necessary, we have that 
for any k and for all n > fc, there exists a partition of 



with respect to a^ and a path Gf^ ^ G Path{m[''^ S^(k) , b.^^]) for each z = 1, 2, • • • , Nk 
such that 

Nk Nk 
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Now, for each fixed k and any i — 1,2, ■ ■ ■ , Nk, the sequence (sl-^] | is also a 

(k) 

da-admissible Cauchy sequence representing a positive Radon measure /i^ of mass 

(k) 

ml '. As a result, for each fixed fc, the measure /Lt can be decomposed as 

E{k) 
A*. 

1=1 

such that each /i^'^'' has mass m^-'^'' for each i and 



i=l 



Case 1: < a < 1. Note that from (2.6 1, we also have the following estimate, 
for any v G and < a < 1, 

v{R"' \ B{x,r))"r < da,{v,5^) for any x € and any r > 0. 

So if r > d„(z/,4)^'", then 

(3.1) v{W"\B{x,r))<d^{v,5^). 

Now let = \ mf and - ^^''^)- Then 

Nk Nk 
i=l i=l 

where a (fc) is the constant 

k 

(3.2) a(fc)= 

r(| + i) 

Moreover, 

\A,) < ^Mf)(M™\i3(a:f\rf))) 

i=l 



i=l 

We set A := IJ,, (fl/o/i ^fe)- Then, for each h, 

^,{M"'\A) < ^,{R"^\mA^)^^^{\J{R"^\Ak)) 

\k>h / k>h 
k>h k>h 

This implies that ^ (M™ \ A) = 0. On the other hand, since 

n^{f]A,)<n^{A,)<^^o, 

k>h 

we have (Clf.^^ Ak) = for all /i e N and thus Ti^ (A) = 0. Thus fi is 

concentrated on a j^^-negligible set A. 
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Case 2: a < 0. We shall denote 



and A, = [^^,B{xT\rr) 



Suppose the atomic measure sS^] is expressed as 



(fc) (n)r 



Then, by corollary 2.3.2 for every x^"'', j — pi, ...Tqi we have 



Jk) _ (n) 



Jk) _ in) 



< rf\ so xf^ e B{xf\rf^). Thus ^(M™ \ Au) = 0. 



Therefore 
Moreover 

i=l i=l 

For A = ^Ak. Then 

fe fe 

Moreover, we have 7iT^(A) < 7iT^(Aj.) < ^^(/z, a^j) ^ as /c ^ oo. Thus /i 
is concentrated on a j-^-negligible set A. □ 

Definition 3.2.2. For any probability measure /i on M™, f/ie Hausdorff dimension 
of p is defined to be 

dimn (p) = inf {dim^ (A) : p {R"'\A) = 0} , 

where dini/f (A) is the Hausdorff dimension of a set A. 

Thus, by Theorem |3. 2. 1| we have 

Corollary 3.2.3. For any a < 1 and any p E V^iR"^), we have 

dim/f(/x) < — . 

1 — a 

Lemma 3.2.4. Let p be the Cantor measure as defined in example \3.2.4\ Then, 
dimn (p) - 1^ 

Proof. Since p is clearly concentrated on the Cantor set whose Hausdorff dimension 
is j^, we have dim// (p) < j^. Therefore, to show dim// (p) — all we need to 
show is 

/ N ln2 
dim// (p) > 
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For this, it is sufficient to show that 

{A) > 1 

whenever /i is concentrated on a set A. Let {Ci} be any collection of sets that 
covers A. We want to show that 

(3.3) E"(^)('^)^^l' 



where a{s) is the constant as given in (3.2 1. Without losing generality, we may 
assume that Ci are closed intervals. Also, if Ci Cj ^ 9 for some i j, then 

{diam{Ci U Cj)Y < {diam{Ci) + diam{Cj)Y < {diam{Ci)Y + {diam{Cj)y 



as the function is concave on [0, 1]. Therefore, to prove the inequality (3.3), we 
may replace Ci and Cj by Ci U Cj. Thus, without losing generality, we may assume 
that {Ci} are disjoint closed intervals Ci — [0^,6^] with the order 

< ai < 5i < a2 < 62 < • • • < On < K < • • • < 1. 
Note that for each i, 

^i{{hi,a,+i)) = 

as ai^i)C]A — and is concentrated on A. Therefore, a„ ((6^, ai+i)) — when 
n is large enough. This implies that the Cantor set is disjoint with (6^, a.i4_i). Thus, 
the Cantor set is also covered by {Ci}. As a result, (UCi) > (Cantor set) = 1. 



This shows the inequahty (3.3) and hence dimjj (^) = s as desired. □ 
3.3. Minkowski dimension of measures. A nested collection 
T={Q'l : i = 1,2,-- - ,iV„ and n = 1,2,---} 

is a collection of Borel subsets of M™ with the following properties: 

(1) for each Q", its diameter 

(3.4) Cicr" < diam {QD < Czfj" 

for some constants C2 > Ci > and some a E (0, 1). 

(2) for any k, I, i,j with I < k, either Q'^ or C Q^.; 

(3) for each Q"^^ there exists exactly one Q" (parent of Q"^^) such that 

(4) for each there exists at least one QJ+^ (child of ) such that g^+^ C 

Each gf is called a cube of generation n in T. If two different cubes g" and g" 
of generation n have the same parent, then they are called brothers to each other. 

A typical example of a nested collection includes collections of standard cubes 
in K"*. That is, let g be a cube in of side length L and fc is a fixed natural 
number. Then, we evenly split Q into fc™ cubes of side length ^. Pick some (or all) 
of these cubes of generation 1 to form a collection Qi. We may then evenly split 
each of the cubes of generation 1 in Qi into cubes of side length ^ to get cubes 
of generation 2. Pick at least one cube of generation 2 from each cube in Qi, we 
get a collection Q2 consisting of cubes of generation 2. Then, we may continue this 
process for each n to get a collection Q„ consisting some cubes of side length . 
The union of all Q„ is clearly a nested collection. 
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Definition 3.3.1. For any nested collection T , we define its Minkowski dimension 
(3.5) diniM (^) := lim j^^^ 

n^oo log(^) 

provided the limit exists, where iV„ is the total number of cubes of generation n. 



By (3.4 1, we have 

diniM (^) < rn 
for any nested collection T consisting Borel subsets of M™. 

Definition 3.3.2. A Radon measure fi is said to be concentrated on a nested col- 
lection J- if for each n, 

Now, we inductively define the centers for cubes in as follows. For any cube 
Q" in J- , if it has more than one child, then one may pick any point p in Q", and 
call it the center of Q". If Q" has only one child, then we pick the center of the 
child as the center of Q". In case that we have an infinite sequence of cubes {Q"} 
such that each cube is the only child of the previous one, then the intersection of 
the closures of these cubes contains at least one point, and we will call this point 
the center of each cube in the sequence. It is easy to see that we have defined a 
center for each cube in by this process. 

Now, for each Q" in JF, let 

be the distance from the center of Q" to the center of its parent Q"^^- Then, by 
definition of centers and (3.4 1, we have 

, , , „, _ J 0, if has no brothers, 

^■^■^> n^^)-<^ < C2a"-i otherwise. 

We now state our key lemma as follows: 

Lemma 3.3.3. Suppose fi is a probability measure concentrated on a nested collec- 
tion T with 

dim^i/ (T) < 

1 — a 

for some a < 1. If for each n, 

(3.7) Y.^'(Q^TliQ:)<c{N„)'~"a" 

for some constant C, then G 
Proof. Let 

(3 -.^ diuiM [T) < 

I — a 



Then, by (3.5 1, when n is large enough, we have 
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Now, we construct a da— admissible Cauchy sequence {a„} for ^ as follows. For 
each n, let 

1=1 

where a;!"-* is the center of Q" and m^"'' = /^(Q") denotes the total mass of on 
Q2- Also, we can construct a partition of a„ with respect to a„_i by grouping 
together masses that are located in centers of children of the same cube. That is, 
for each j = 1, 2, • • • , iV„_i, let 

where the summation is over all i's such that Q" is a child of Q"^^- For each Q", 

we may transport the mass TOj-"'' from the center of its parent to its center, and 
thus construct a path G^^^ from a„_i to a„. Then, by (3.7), 

i=l 

when n is large enough 

for 6 = cri~/3(i-") £ (0, 1). As a result, {a„} is a dc-admissible Cauchy sequence 
representing fj, and thus fi € 'Da- D 

Definition 3.3.4. For any Radon measure /i, we define the Minkowski dimension 
of the measure /i to be 

dimM (m) := inf {dimA/ (J^)} 
where the infimum is over all nested collection T that /i is concentrated on. 

Theorem 3.3.5. Suppose fi is a probability measure with dim7v/(/i) < for some 
< a < 1, then /i G Da- 

Proof. Since dimj\/(/i) < jz^, A' is concentrated on a nested collection J- with 

dimM {^) < z-^ — • 
I — a 

When a > 0, by Holder inequality and ( |3.6[ ) we have 

N„ 
i=\ 

i=l / i=l 

= C2iV^~"(T"-\ because ^/^(Q?) = 1 

1=1 

Thus, by lemma [3. 3. 3[ we have /i e Da. O 
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3.4. Evenly concentrated measures. Now, we aim at achieving a similar result 
as in theorem |3.3.5| for the case a < 0. To do it, we introduce the following 
definition: 

Definition 3.4.1. A Radon measure fi is evenly concentrated on a nested collection 
T if for each cube Qf of generation n in T , either Q" has no brothers or /i (Qf) > 
^ for some constant A > 0. 

Here, Q" has no brothers means that the parent of Qf has only one child, namely 
itself. 

Now, we provide some examples of a measure /i that is evenly concentrated on 
some nested collection. 



0.5 



0.5 




Figure 4: Here, k — 3,m = 2 and h — 7. Boxes in the first row is chosen determi- 
nately, and boxes in the second row is picked randomly. 



Example 3.4.2. Suppose k is a fixed natural number and h is a natural number 
no more than /c™. Let Bq be a fixed box in M™ and we set Bi — {-Bo}- Now, for 
each n > I, and for each B S Bn, we split B evenly into k™' many smaller boxes 
of length size cr" and then pick h smaller boxes from them either determinately or 
randomly. Let Bn+i be the collections of all smaller boxes as picked above. Thus, 
for each n, the cardinality of Bn+i is Nn^i = hNn — h" for each n. Let 

where xb is the center of the box B G Bn. Then, {a„} is a sequence of probability 
measures. For each B G Bn, we transport the mass -j^ from the center xb' of B's 
parent B* G Bn-i to xb. Ln this way, we construct a path Gn^^ from a„_i to a„. 
For each a, 



E 

i=l 



1 



{Nn 



1 
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Thus, {a„} G Va if — < I i.e. > log^, which is the Minkowski dimension of 
the nested collection B = 1J„ S„. Hence, {a„} is a da-Cauchy sequence representing 
a probability measure /i. Moreover, for each j > 0, 



E 



1 



B'dB 



N, 



n+j 



1 



a„ {B) 



Hence, /i is evenly concentrated on the nested collection B with the property that 
fi (B) — for each B £ Bn. 

Example 3.4.3. Now, we modify the previous example as follows. For any B G Bn, 



7cubes 



48cubes 



341 cubes 



2373cubes 




Figure 5: Here, fc = 3, to = 2 and h — 7. We divide every cube into 9 smaller cubes 
and then pick about 7 ones from them with some randomness. For any particular 
cube, if the number of cubes picked at the current stage is less than ( or respectively 
greater than) the desired one (i.e. 7), more (or respectively less) smaller cubes 
inside it will be picked for the next generation. 



suppose 



card{B* eBn+j : B* C B} 
\2 < j^^ < Ai 



whenever j is large enough for some constants < A2 < Ai. Define {a„} , B and fi 
as before. Thus, when j is large enough 



Then, 



Nn+j = card{B* G Bn+j : B* d B} 

BeB„ 

< ^^^^ = ^l^nh^- 



BeB„ 



fi{B) = lima„+j (B) 



card{B* £Br,+, ■■ B* C B] 
lim 



> lim 



A for A = ^ 



Therefore, in this case, /i is still evenly concentrated on a nested collection B. 
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Recall that a nonncgativc Borel measure /x is Ahlfors regular of dimension d if 
there exists a constant C > such that 

whenever < r < diam (spt (/i)) and x lies in the support spt (fj.) of /i. 

Example 3.4.4. // fj, is an Ahlfors regular measure whose support is a box B in 
M™ , then fj. is evenly concentrated on a nested collection. 

Proof. By splitting the box B into smaller boxes of length size cr", we get a collection 
Bn of disjoint boxes. For any B £ Bn, v/e have 

for some d and for some suitable constants a,A>0. Thus, 

where Nn is the cardinality of the collection S„. Therefore, for every B E Bn 

i^(b) a 

where A = -^fi (ji^ > 0. Hence, /i is evenly concentrated on a nested collection. □ 

Definition 3.4.5. For any Radon measure jj., we define 

dim(7 (/Lt) := inf {dimji/ {T)} 
where the infimum is over all nested collection T that [i is evenly concentrated on. 
Obviously, 

dimAf {pi) < dimu (/i) . 
Now, we have the following theorem for any a < 1. 

Theorem 3.4.6. Suppose ^ is a probability measure with dim[/(/i) < for some 
a < I, then fj, € 'Da- 



Proof. Since dimM (z^) < divnu (/i), by theorem 3.3.5 pL g whenever < a < 1. 
Thus, we only need to consider the case that a < 0. Since dim(7(/Lt) < jh^^ 
evenly concentrated on a nested collection T with 

dimjv/ (^) < —- — • 
1 — a 

For each cube Q" in T, if Q" has no brothers, then I (Q") =0. If Q" has more 
than one brother, then 

Ai(Qr)> ^ andZ(Qn <C2a"-i. 



Thus, when a < 0, we always have 

3'M < ( 

Therefore, 
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By lcnima [3.3.3[ we still have fi E T>a when a < 0. □ 
3.5. Comparison of dimensions of measures. 

Definition 3.5.1. For any probability measure /i, we define the transport dimension 
of n to be 

diniT = inf I — : ^ G 



a<i yi ~ a 

Theorem 3.5.2. Let /i be any positive probability measure on M™. Then, the 
following bounds hold 

dim//(/x) < dimT(M) < maa;{dimAf (/^), !}• 
Moreover, we also have 

dini/f (/i) < dim7'(/i) < dim[/(/^). 

Proof. Assume that diniT(/i) > niax{diniM(M), 1} (or dimj/(/z), respectively ). 
Then, we may pick an a < 1 so that 

dimrf/i) > > max{dimji/(/i), 1} (or dim^ (/x) respectively) . 

1 — a 

By Theorem 3.3.5| (or Theorem 3.4.6[ ), /i G I?q,(M™). This is a contradiction to the 

definition of dim-r (/i) . 

On the other hand, by Theorem 3.2.1 dim/f(/i) < whenever ^ £ I?a(IR™). 
Thus, dimij(/x) < dimT{n). □ 

Example 3.5.3. Let fi be the Cantor measure as in Example 3.1.3\ Let Q" denotes 
the I*'' Cantor interval of length gir. Then, clearly /i is evenly concentrated on the 
nested collection C = {Qf : i =,2, ■ ■ ■ , 2"}^-^. Note that 

dimM(C)=lim^.lim^°S(2")-^^2 



n^oo log (3" ) n^oo log (3") In 3 

In 2 
ln3 ■ 



On the other hand, by lemma 3.2.4 dim// (/i) — Therefore, by theorem 



3.5.2 



dimH/.)=|;^. 

4. The Dimensional Distance between probability measures 
For any a < 1, let 

5„ (M") = {A (/i - I/) : A > 0, e V^{W)} 
be a collection of signed measures. Clearly, S^^ (M™) C (M™) if ai < q;2. 
Definition 4.0.4. For any two probability measures /i, v on M™, we define 

D{n,u) inf{— : ^l - v £ S^{W)}. 

a<l \ — a 

Proposition 4.0.5. D{^,v) is a pseudometri^ on the space of probability mea- 
sures on M™ . 



A pseudometric D means that it is nonnegative, symmetric, satisfies the triangle inequality, 
and D (fi, p.) = 0. But D (p, v) = does not imply fj, = v. 
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Proof. It is clear that D is symmetric and nonnegative. Also, for any a < 1, 
II — 11 (z Sa, so /i) — 0. Now we only need to check the triangle inequality: 

(4.1) D{lJ.i,IJ.2) + D{ll2,fl3) > D{ni,fi3), 

for any probability measures /^i, /i2, /is on M™. 

Note that (4.1) is clearly true if either £'(/ii,/i2) or I?(/J.2,M3) is infinity. Thus 

we may assume that both _D(/ii,/i2) and L)(/i2,/i3) are finite. So, /xi — /i2 € 5q.j 
and /i2 ^ /i3 G 5q.2 for some ai, Q!2 < 1- Now, 

/"I - /^3 = (M1 - + (M2 - Ai3) e 5„iax{ai,Q2} (I*™) ■ 

Therefore, 

1 1 1 

D{fil,fi3) < , ^ < + 



1 — max{ai,a2} 1 — ai 1 — q;2 



By taking an infimum over ai and a2 we obtain the triangular inequality (4.1 1. □ 

In general, D is not necessarily a metric. Indeed, for any two atomic probability 
measures a, b , we have a — b ESa (M™) for any a < 1. Thus, D (a, b) = while 
a and b are not necessarily the same measure. Nevertheless, we may easily extend 
the pseudometric D to a metric on equivalent classes of measures. To this end we 
define a notion of the equivalence classes on measures. 

Definition 4.0.6. For any two probability measures fi and v on M™, we say 

^ ^ V if D{^, v) ~ 0. 

The equivalent class of fj, is denoted by [//]. 

For instance, all atomic probability measures are equivalent to each other. 

Proposition 4.0.7. If Hi ^ /i2 and vi ^ v^, then D{[i\.,v\) = D{fi2,i^2)- 

Proof. By definition of equivalence of measures we have D^fii, 112) = and Diyi, V2) - 
0. So by the triangular inequality (4.1 ) we have 

\D{^li,Ui) - D{fl2, V2)\ < D{fii,^l2) + D{vi,U2) = 0. 

□ 

Definition 4.0.8. For any equivalent class [/i] and [i^], define 

From this definition and proposition |4 . . 5| clearly, we have the following theorem. 
Theorem 4.0.9. D is a metric on V (M™) / . 

Definition 4.0.10. The metric D is called the dimensional distance on the space 
V (M™) / ^ of equivalent classes of probability measures on M™. 

We now give a geometric meaning to transport dimension of measures. 

Theorem 4.0.11. For any positive probability measure /i, we have 

dimT(/i) = D([Ai],[a]) = i?(Ai,a) 

where a is any atomic probability measure. 
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Proof. Since a gPq, for any q < 1, wc have 

D{ii,a) = inf {— : /Li-a e ^a} 
a<l 1 — a 

= inf {— *— : n e Va} = dimr (/x) . 
a<l 1 — a 

□ 

This theorem says that the transport dimension of a probabihty measure /x is the 
distance from /x to any atomic measure with respect to the dimensional distance. 
In other words, the dimension information of a measure tells us quantitatively how 
far the measure is from being an atomic measure. 
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